Understanding Web Archiving Services and Their (Mis)Use on Social Media
نویسندگان
چکیده
Web archiving services play an increasingly important role in today’s information ecosystem, by ensuring the continuing availability of information, or by deliberately caching content that might get deleted or removed. Among these, the Wayback Machine has been proactively archiving, since 2001, versions of a large number of Web pages, while newer services like archive.is allow users to create on-demand snapshots of specific Web pages, which serve as time capsules that can be shared across the Web. In this paper, we present a large-scale analysis of Web archiving services and their use on social media, shedding light on the actors involved in this ecosystem, the content that gets archived, and how it is shared. We crawl and study: 1) 21M URLs from archive.is, spanning almost two years; and 2) 356K archive.is plus 391K Wayback Machine URLs that were shared on four social networks: Reddit, Twitter, Gab, and 4chan’s Politically Incorrect board (/pol/) over 14 months. We observe that news and social media posts are the most common types of content archived, likely due to their perceived ephemeral and/or controversial nature. Moreover, URLs of archiving services are extensively shared on “fringe” communities within Reddit and 4chan to preserve possibly contentious content. Lastly, we find evidence of moderators nudging or even forcing users to use archives, instead of direct links, for news sources with opposing ideologies, potentially depriving them of ad revenue.
منابع مشابه
Entity-Based Opinion Mining from Text and Multimedia
Social web analysis is all about the users who are actively engaged and generate content. This content is dynamic, reflecting the societal and sentimental fluctuations of the authors as well as the ever-changing use of language. Social networks are pools of a wide range of articulation methods, from simple ”Like” buttons to complete articles, their content representing the diversity of opinions...
متن کاملSocial Media in Public Libraries: Recognition of Applications, Obstacles and Problems of Use
Background and Aim: Social media because of its interactive nature and the fact that it is being free of charge is widely used in libraries. Web 2.0 is a tool that offers permanent connection every time and offers educational programs without limitations of place and time. But what is included in social media application in public libraries and what obstacles and problems are there in the way...
متن کاملStories From the Past Web
Archiving Web pages into themed collections is a method for ensuring these resources are available for posterity. Services such as Archive-It exists to allow institutions to develop, curate, and preserve collections of Web resources. Understanding the contents and boundaries of these archived collections is a challenge for most people, resulting in the paradox of the larger the collection, the ...
متن کاملWADL 2016 Panels: Worldwide activities on Web archiving; Social media, Web archiving, and digital libraries
In addition to presentations based around scholarly papers, the Web Archiving and Digital Libraries 2016 workshop also featured two panel discussion sessions. Each panel centered on a theme and featured short presentations by the panelists, followed by a moderated discussion and interaction with workshop participants; the panels are described herein. 1. PANEL 1: WORLDWIDE ACTIVITIES ON WEB ARCH...
متن کاملA Survey of Librarians' Perspectives on Marketing Library Services Using Social Media in Tehran, Iran, and Shahid Beheshti Universities of Medical Sciences
Background and Aim: The present study has examined librarians' views on the marketing of library services using social media as well as the applications, benefits, and challenges of their use in Tehran, Iran, and Shahid Beheshti Universities of Medical Sciences. Materials and Methods: This research was a descriptive and applied survey and was conducted in 2019. The data collection tool was a ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1801.10396 شماره
صفحات -
تاریخ انتشار 2018